Search results for "Memory hierarchy"

showing 4 items of 4 documents

Optimizing the Integration Area and Performance of VLIW Architectures by Hardware/Software Co-design

2021

The cost and the performance are major concerns that the designers of embedded processors shall take into account, especially for market considerations. In order to reduce the cost, embedded systems rely on simple hardware architectures like VLIW (Very Long Instruction Word) processors and they look for compiler support. This paper aims at developing a design space explorer of VLIW architectures from different perspectives like processing performance and integration area. A multi-objective Genetic Algorithm (GA) was used to find the optimum hardware configuration of an embedded system and the optimization rules applied by compiler on the benchmarks code. The first step consisted in represen…

education.field_of_studyInstructions per cycleMemory hierarchyComputer sciencePopulationEvolutionary algorithmOptimizing compilerParallel computingcomputer.software_genreVery long instruction wordGenetic algorithmCompilereducationcomputer
researchProduct

Empirical Autotuning of Two-level Parallel Linear Algebra Routines on Large cc-NUMA Systems

2012

In large cc-NUMA systems the efficient use of the different levels of the memory hierarchy is not an easy task, and the performance of multithreading implementations of the libraries decreases when the number of cores used increases, so producing an important lost of efficiency. To alleviate this problem, routines with multilevel parallelism can be developed by combining OpenMP and BLAS parallelism. In that way, higher performance can be achieved, but it is necessary to develop some autotuning technique for the appropriate selection of the number of threads to use at each level. The selection can be made through theoretical models of the execution time or some installation methodology. This…

Task (computing)Selection (relational algebra)Memory hierarchyComputer scienceMultithreadingLinear algebraParallelism (grammar)Parallel computingTemporal multithreadingMatrix multiplication2012 IEEE 10th International Symposium on Parallel and Distributed Processing with Applications
researchProduct

<title>Managing compressed multimedia data in a memory hierarchy: fundamental issues and basic solutions</title>

1998

The purpose of the work is to discuss the fundamental issues and solutions in managing compressed and uncompressed multimedia data, especially voluminous continuous mediatypes (video, audio) and text in a memory hierarchy with four levels (main memory, magnetic disk, (optical or magnetic) on-line/near-line low-speed memory, and slow off-line memory, i.e. archive). We view the multimedia data in such a database to be generated, (compressed), and stored into the memory hierarchy (at the lowest non-archiving level), and subsequently retrieved, (decompressed), and presented. If unused, the data either travels down in the memory hierarchy or it is compressed and stored at the same level. We firs…

Hardware_MEMORYSTRUCTURESFlat memory modelTheoretical computer scienceMultimediaMemory hierarchyComputer scienceThrashingcomputer.software_genreMemory mapMemory managementPhysical addressVirtual memoryInterleaved memorycomputerSPIE Proceedings
researchProduct

VLBI-resolution radio-map algorithms: Performance analysis of different levels of data-sharing on multi-socket, multi-core architectures

2012

a b s t r a c t A broad area in astronomy focuses on simulating extragalactic objects based on Very Long Baseline Interferometry (VLBI) radio-maps. Several algorithms in this scope simulate what would be the observed radio-maps if emitted from a predefined extragalactic object. This work analyzes the performance and scaling of this kind of algorithms on multi-socket, multi-core architectures. In particular, we evaluate a sharing approach, a privatizing approach and a hybrid approach on systems with complex memory hierarchy that includes shared Last Level Cache (LLC). In addition, we investigate which manual processes can be systematized and then automated in future works. The experiments sh…

Multi-core processorMemory hierarchy010308 nuclear & particles physicsComputer scienceGeneral Physics and AstronomyParallel computing01 natural sciencesScheduling (computing)Data sharingComputer engineeringHardware and Architecture0103 physical sciencesVery-long-baseline interferometryScalabilityCache010303 astronomy & astrophysicsScalingComputer Physics Communications, CPC, 1937-1946 (2012)
researchProduct